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Abstract 

We study the 2-dimensional vector packing problem, which is a generalization 

Q ■ of the classical bin packing problem where each item has 2 distinct weights 

q ■ and each bin has 2 corresponding capacities. The goal is to group items into 

minimum number of bins, without violating the bin capacity constraints. We 

propose an ©(n)-time approximation algorithm that is inspired by the 0{n 2 ) 

^ ■ algorithm proposed by Chang, Hwang, and Park. 
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1. Introduction 

X 
b 

In the classical bin packing problem, we are given a bin capacity, C, a set 
of items A = {cii, 02, ... , a n }, and we try to find a minimum number of bins 
B 1 , B 2 , . . . , B m , such that U™ Y Bi = A and ^2 a . eB . % < C for i — 1, . . . , m. 
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The vector packing problem is a generalization of this problem to multiple 
dimensions. In the d- dimensional vector packing problem, each item has d 
distinct weights and each bin has d corresponding capacities. Let a k denote 
the weight of the ith object in the kth dimension, and let C k denote the bin 
capacity in the fcth dimension. The goal is to group items into a minimum 
number of bins Bi, B 2 , . . . , B m such that 



This problem has been the subject of many research efforts. A survey of 
these efforts is provided by Lodi, Martello, and Vigo in 

In this paper, we study the 2-dimensional vector packing problem. Our 
motivation is allocating files to disks, hence the items are files and the two 
weights are the size and the load of the file. The load of a file refers to how 
much time a server is expected to spend with that file, and depends on access 
frequency, as well as file size. The constraints on the bins correspond to stor- 
age and service capacity of the disk. The sizes of the problem instances are 
extremely large, and excessive computational costs are prohibitive. There- 
fore we have to adopt efficient heuristics with small memory footprint and 
limited computational overheads. 

We propose an in-place 0(n) approximation algorithm that generates 
solutions that use no more than , where p is the ratio of the maximum 

1-/9 

a k n 

item weight to the corresponding bin capacity, i.e, p = max— p In [2j, 



an interesting general solution to the d-dimensional vector packing problem 
using linear programming relaxation is presented with a bound of In d + 1 
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from optimal. In our case, the cost of implementing an LP based algorithm 
is not practical due to the scale of applications we are consider inghere. Our 
work is closely related to the work of Chang, Hwang and Park [3j, and we 
improve the 0(n 2 ) complexity of their algorithm to Q(n). 

2. Notation 

Given a set of n items, let Si and U denote the two weights of the ith 
item. The problem we want to solve is: 

Given a list of tuples (si, (s 2 > h), ■ ■ ■ , (s n , l n ), and bounds Cs and Cl, 
find a minimum number sets B%, B 2 , . . . , Bk, so that each tuple is assigned to 
a set Bj, and 

\J Si < Cs and U < Cl for j = 1, ... k 



For simplicity, we will normalize Cs and Cl so they are both equal to 1 and 

the Si's and l^s are normalized accordingly so that they are fractions of Cs 

and Cl, and are all within the range [0,1]. 

We say an item is s-heavy if s, > /j and 1-heavy otherwise. We define 

p as the maximum value among all Sj and values (i.e., p = max{sj,/j : 

1 < i < n}). A bin Bi is s-complete if its cumulative s- weight, S, satisfies 

1 — p < S < 1; I- complete, if its I- weight, L satisfies 1 — p < L < 1; and 

complete if it is both s-complete and I- complete. We will prove that the 

number of bins used by the algorithm is within a factor of of the 

1R p 

optimum. Since for most applications p <^ 0.5, the algorithm of [3:] is better 
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for our purposes than that of |4j which gives a 2-optimal solution, but runs 
in 0{n\gn) time. 

3. The Algorithm 

In this section we present AlgorithmHJ which decreases the 0(n 2 ) runtime 
of the algorithm in |3| to Q(n). Let S and L denote the sum of s and l- 
weights of the items in the current bin. As mentioned earlier, the notion 
of bin completeness is central to the algorithm and refers to the fact that a 
current bin is sufficiently utilized and can be closed and a new bin started 
with a guarantee that the overall bound from optimality will not be violated. 
In this algorithm, each bin starts with the addition of the first unassigned 
item. At each iteration, the algorithm adds an s-heavy or an /-heavy item 
depending on whether L > S or S > L, respectively, This continues until 
the bin is s-complete (or 1-complete) or the size bound is violated. In {^J it 
is shown that once the size bound is violated, the bin can be reduced to be 
s-complete (or 1-complete), by removing a special item from the bin. A key 
contribution in this paper is how to locate that special item in Theta(l) time, 
granting an 0(n) time for the algorithm, as opposed to the 0(n 2 ) runtime of 
[3] . Exactly one of the functions Pack -Remaining S or Pack -Remaining _L 
is called after exiting the while loop when it is known that the remaining 
unassigned items are homogeneous such that they are either all s-heavy or 
all /-heavy. These functions perform a simple one dimensional bin packing. 
In P 'ack -Remaining S, the bins are packed based on the s values and each 
bin is packed until it is s-complete before starting a new bin. Similarly, in 
Pack -Remaining _L, packing is based on / values and a new bin is started 
when the current bin is /-complete. 

Another key contribution is the design of data structures that avoid any 
auxiliary storage. Our algorithm is an in-place algorithm, which is important 
for massive data sets, and vital for data base reorganization. The algorithm 
uses two pointers sp and Ip that point to the first unassigned item for which 
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Algorithm 1: Algorithm Pack_Disks 



1 Given an array F — ((si,li),..., (sn, In)), find D , D 1; . . . , D q such 
that A-i to A — 1 constitute the i-th bin on the permuted F array 

2 -S-^-Si; L ^ li, D 1; A 2; 

3 if S > L then last_s 1; else last_l 1; 

4 sp <r- find_next_s(l); Zp -f- find_next_l(l); 

5 while Ip < N and sp < N do 
if 5 > L then 

L L + l lp ; S <r- S + s ip ; 
if 5 > 1 then 

j swap(Zp, last_s); L ^- L- /i ast _ s ; S <- S - s last _ g ; 
else 

if sp < Ip then 

I swap(/p, Di); sp <— sp + 1; 

lastJ <r- Di Di 4- A + 1; 

/p -f- find_next_l(7p); 
else 

L i — L + l S p] S 4 — S + Z S p; 
if L > 1 then 

| swap(sp, last J); L <— L — Z lagt j; S ^ S - s last j; 
else 

if £p < sp then 
|_ swap(sp, Di); Ip <— lp+ 1; 

last_s <- A; Di<- Di + 1;; 

find_next_s(sp); 

if S>l-p and L>l-p and Di < N then 

L<-l D .; S <- s Di ; i <- i + 1; A «- A-i + 1; 
if S* > L then 

j lasts A; sp <h- find_next_s(sp); 
else 



I /astJ A; find_next_l(/p); 



30 if (sp < JV) then Pack_Remaining_S; 

31 if (Zp < A) then Pack_Remaining_L; 
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Si > U and li > Si, respectively. The function findjnexts(j) returns the 
smallest index % > j of an unassigned item such that Si > U and symetrically, 
findjnextJ(j) returns the smallest % > j such that li > Sj. The cumulative 
sum of Si and U values for the current bin are stored in S and L. The index 
of the last s-heavy item added to the current bin is stored in lasts, and the 
last /-heavy item is stored in lastJ. 

Lemma 1. If S>L and S+Si p >l, then S — L < si ast _ s — li as t.s, where lasts 
is the index of the last s-heavy item added to the bin. 

Proof. Condition S>L implies that at least one s-heavy item was added to 
the current bin, thus lasts has been initialized. Let S' and V be the sum 
of s- and 1-weights of the items added before last_s, and let S and L be the 
sum of s- and 1-weights of the items added after lasts. We know L' > S', 
since the algorithm chose to add an s-heavy item, and L > S, since we have 
been adding 1-heavy items after lasts. This gives us 

(S' + S) - (L' + I) < 

(S + S + Si as t_s) — (L + L + Zlast-s) — s last_s — ^last_s 

S — L < Slast_s — ^lastj 

Lemma 2. If S > L and S + si p > 1, then the current bin will be complete 
after removing lasts and adding Ip. 

Proof. This result is already proven in 

Lemma 3. If L > S and L + l sp > 1, then L — S < li as u — si as u, and the 
current bin will be complete after removing lastJ and adding sp. 

Proof. The proof is based on arguments in proofs of Lemma [1] and Lemma [2j 
The previous two lemmas form the algorithmic basis of our algorithm, in 
the following lemma we focus on the correctness of our data structures. 
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Lemma 4. After each iteration of the while loop, Ip and sp point to, re- 
spectively, an l-heavy and s-heavy item with the smallest index > Di. The 
pointers lastJ and last_s point to the last s- and l-heavy item in the current 
bin, respectively. 

Proof. We will only discuss the case S > L, since the other case is symmetric. 
Note that min{sp, lp} = Di. That is, either sp or Ip points to the first 
unassigned item. The execution of the algorithm depends on whether S + 
si p > 1 and whether sp < lp. If S + si p > 1, we want to add lp and remove 
last_s from the current bin. In this case if lp < sp (thus lp = Di), the 
algorithm moves last_s to the position Di, which subsequently is assigned as 
the first item of the next bin within the same iteration on line 23. Therefore, 
sp still points to the l-heavy item with the smallest index not currently 
assigned, and lp moves to the right item by a call to findjnextJ. If lp > sp, 
then the last_s item is moved in place of lp, which is ahead of sp. So once lp 
moves ahead by a find_next call it will find the l-heavy item with the smallest 
index not currently assigned. 

If S + sip > 1, we need to add lp to the current bin. If lp < sp (thus 
Di = lp), then incrementing Di, and then using findjnextJ will be sufficient, 
if sp < lp (thus Di = sp), then we need to put lp to replace sp. In this case 
incrementing, sp by 1 guarantees that it will be pointing to an s-heavy object 
is also the smallest unassigned index. 

It is easy to follow that updates on lastJ and last_s are done correctly. 

Lemma 5. Algorithm^ makes 2 scans and uses n + q data moves, where n 
is the number of items to be packed and q is the number of bins used. 

Proof. The algorithm uses two pointers lp and sp that read the values of the 
data items and they only move forward. At each step of the algorithm, we 
either swap an item to position Di or last J (lasts). Di can move up to n 
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(the number of items), and each swap with last J (lasts) means a bin being 
complete by Lemma [2] and Lemma [3] . 

Theorem 1. Algorithm^ runs in 0(n)-time to generate a solution with no 
more than + 1 bins, where C* is value of an optimal solution. 

Proof. 

Clearly C* > max{ s m h}- On the other hand, by Lemmas 

|5]and[31 the algorithm packs all subsets Di (except possibly for the last one) 
such that exactly one of the following 3 cases occurs: 

1. all subsets D^s are complete 

2. all subsets D^s are s- complete, one or more are not l-complete 

3. all subsets D^s are l-complete, one or more are not s-complete 

Under case 1), the theorem follows directly Under case 2), 

c PD <i + -^— V Sl <i + -^—c\ 

1 — p / — ' 1 — p 

An analogous argument also works under case 3) thus proving our bound. 
The linear runtime of the algorithm is an implication of Lemma [5j 

4. Conclusions 

We studied the 2-dimensional vector packing problem. We described an 
in-place, B(n)-time approximation algorithm that finds solutions within 
of an optimal, where p is maximum normalized item weight. Our algorithm 
also limits the number of item moves to at most n + k, where n is the 
number of items and k is the number of bins used. A simple generalization 
of our linear time algorithm to 3-dimensional vector packing can be shown 
with a bound of -^- p from optimal. This is done by first running the 2- 
dimensional solution on the first two dimensions of each item (ignoring the 
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third dimension) and then applying a one dimensional bin packing algorithm 
on the contents of each bin based only on the third dimension. It remains an 
open problem whether better bounds are possible with linear time algorithms 
where item weights satisfy size constraints. 
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